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EXPRESSION MODULATING SEQUENCES 

FIELD OF THE INVENTION 

5 

The present invention relates generally to novel nucleic acid molecules capable of increasing 
expression of nucleotide sequences in eukaryotic cells. The novel nucleic acid molecules of the 
present invention may be used to increase and/or stabilise or otherwise facilitate expression of 
nucleotide sequences resulting in the presence of a translation product or may be used to down 

10 regulate expression by, for example, promoting transcript degradation via mechanisms such as 
co-suppression. The nucleotide sequence of the present invention is referred to herein as an 
"expression modulating sequence" and generally results in the acquisition of a phenotypic trait 
or loss of a phenotypic trait. The expression modulating sequence of the present invention is 
useful inter alia to increase and/or stabilise or otherwise facilitate expression of nucleotide 

15 sequences in eukaryotic cells and in particular the expression of therapeutically, agriculturally and 
economically important transgenes. The expression modulating sequence of the present 
invention may also be used to inhibit, reduce or otherwise down regulate expression of a 
nucleotide sequence such as a eukaryotic gene including a pathogen gene, the expression of 
which, results in an undesired phenotype. 

20 

BACKGROUND OF THE INVENTION 

Recombinant DNA technology is now an integral part of strategies to generate genetically 
modified eukaryotic cells. For example, genetic engineering has been used to develop varieties 
25 of plants with commercially useful traits and to produce mammalian cells which express a 
therapeutically useful gene or to suppress expression of an unwanted gene. Transposons have 
played an important part in the genetic engineering of plant cells and some non-plant cells to 
provide inter alia tagged regions of genomes to facilitate the isolation of genes by recombinant 
DNA techniques. 

30 

The maize transposon Activator (Ac) and its derivative Dissociation (Ds) comprise one of the 
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first transposon systems to be discovered (1,2) and was first used to clone genes by Fedoroff et 
al (3). The behaviour of Ac in maize has been studied extensively and excision occurs in both 
somatic and germline tissue. Studies have highlighted two important features of Ac/Ds for 
tagging. First, the transposition frequency and second, the preference of Ac/Ds for transposition 
5 in linked sites. 

The use of the Ac/Ds system has been hampered by the difficulty of data interpretation due, for 
example, to the high activity of Ac in certain plants and insertions at unlinked sites arising from 
multiple transpositions rather than by a single event from the T-DNA. This problem was 

10 addressed by Jones et al (4), Carroll et al (5) and others where a two component Ac/Ds system 
was developed. In this system, the Ds elements were made by replacing the Ac transposase gene 
with a marker gene thereby rendering it non-autonomous. T-DNA regions of binary vectors 
were constructed by Carroll et al (5) and Scofield et al (6) carrying either a Ds element or a 
stabilised Activator transposase gene (sAc). The Ds element contained a reporter gene (eg. 

15 nos.BAR) which was shown to be inactivated on crossing with plants carrying the sAc (5). This 
is referred to as transgene silencing. It has been shown that transgene silencing is a more general 
phenomenon in transgenic plants (7, 8, 9). Many different types of transgene silencing have now 
been reported in the literature and include: co-suppression of a transgene and a homologous 
endogenous plant gene (10), inactiviation of ectopically located homologous transgenes in 

20 transgenic plants (7), the silencing of transgenes leading to resistance to virus infection (1 1) and 
inactivation of transgenes inserted in maize transposons in transgenic tomato (5). 

Gene silencing undoubtedly reflects mechanisms of great importance in the understanding of 
plant gene regulation. It is of particular importance because it represents a severe obstacle to 
25 stable and high level expression of economically important transgenes (7). 

In work leading up to the present invention, the inventors sought to identify nucleotide sequences 
which might prevent or otherwise reduce gene silencing and to facilitate increased and/or 
stabilized gene expression in eukaryotic cells such as plant cells. In accordance with the present 
30 invention, the subject inventors have now identified and isolated novel nucleotide sequences 
referred to herein as "expression modulating sequences" or "EMSs" which are useful in 
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increasing or stabilizing nucleotide sequence expression in eukaryotic cells such as plant cells. 
Such increased and stabilised nucleotide sequence expression can also lead to the promotion or 
induction of transcript degradation via mechanisms such as co-suppression. Accordingly, the 
EMSs of the present invention may also be used to inhibit, reduce or otherwise down-regulate 
5 expression of target nucleotide sequences. 

SUMMARY OF THE INVENTION 

Throughout this specification, unless the context requires otherwise, the word ,, comprise , \ or 
10 variations such as "comprises" or "comprising", will be understood to imply the inclusion of a 
stated element or integer or group of elements or integers but not the exclusion of any other 
element or integer or group of elements or integers. 

Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide sequences referred to in the 
15 specification are defined following the bibliography. A summary of the SEQ ID NOs is given 
in Table 1. 

One aspect of the present invention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which modulates expression of a second nucleotide sequence inserted 
20 proximal to said first mentioned nucleotide sequence. 

More particularly, the present invention is directed to an isolated nucleic acid molecule 
comprising a sequence of nucleotides which increases or enhances expression of a second 
nucleotide sequence inserted within said first mentioned nucleotide sequence. 

25 

Another aspect of the present invention relates to an expression modulating sequence (EMS) 
comprising a sequence of nucleotides which increases or enhances expression of a nucleotide 
sequence inserted adjacent to, within or otherwise proximal to said EMS. 

30 Still another aspect of the present invention contemplates a genetic construct comprising an EMS 
as herein defined and means to facilitate insertion of a nucleotide sequence within, adjacent to 
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or otherwise proximal to said EMS. 

Still yet another aspect of the present invention provides a genetic construct comprising an EMS 
as herein defined and means to facilitate insertion of a nucleotide sequence within, adjacent to 
5 or otherwise proximal with said EMS and operably linked to a promoter. 

Another aspect of the present invention contemplates a method of increasing or stabilizing 
expression of a nucleotide sequence or otherwise preventing or reducing silencing of a nucleotide 
sequence in a eukaryotic cell said method comprising introducing into said eukaryotic cell the 
10 nucleotide sequence flanked by, adjacent to or otherwise proximal to an EMS. 

More particularly, the present invention provides a method of increasing of stabilizing expression 
of a nucleotide sequence or otherwise preventing or reducing silencing of a nucleotide sequence 
in a plant or cells of a plant said method comprising introducing into said plant or plant cells the 
15 nucleotide sequence flanked by, adjacent to or otherwise proximal with an EMS. 

In an alternative embodiment, the present invention provides a method of inhibiting, reducing or 
otherwise down-regulating expression of a nucleotide sequence in a eukaryotic cell, said method 
comprising introducing into said eukaryotic cell the nucleotide sequence flanked by, adjacent to 
20 or otherwise proximal with an EMS. 

More particularly, the present invention is directed to a method of inhibiting, reducing or 
otherwise down-regulating expression of a nucleotide sequence in a plant or cells of a plant said 
method comprising introducing into said plant or plant cells the nucleotide sequence flanked by, 
25 adjacent to or otherwise proximal with an EMS. 

Yet another aspect of the present invention provides a transgenic animal or plant carrying a 
nucleotide sequence flanked by, adjacent to or otherwise proximal with an EMS. 

30 Still a further aspect of the present invention provides an improved transposon tagging system, 
said system comprising a transposable element carrying a nucleotide sequence flanked by, 
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adjacent to or otherwise proximal with an EMS. 
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TABLE 1 
SUMMARY OF SEQ ID NOs. 



SEQ ID NO. DESCRIPTION 

1 Nucleotide sequence of tomato a-amylase gene promoter 

2 Nucleotide sequence of a-amylase gene promoter 

3 Nucleotide sequence of genomic DN A upstream of Dem 
gene followed by Dem cDNA coding sequence. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagrammatic representation showing T-DNA regions of binary vectors carrying 
a Ds element (SLJ1561) of the transposable gene (SLJ10512)[5]. The Ds element carries a 
5 nos:BAR gene and is inserted into a nos:SPEC excision marker. The transposon gene sAc is 
linked to a 2':Gus reporter gene. 

Figure 2 is a diagrammatic representation showing an experimental strategy for generating 
tomato lines carrying transposed Ds elements (5). Fl plants heterozygous for both the Ds and 
10 sAc T-DNAs are test-crossed to produce TC X progeny. The TC { progeny are then screened for 
lines carrying a transposed Ds and a reactivated nos:BAR gene. 

Figure 3 is a photographic representation showing expression and silencing of the nos.BAR gene 
in various tomato lines. Seedlings were germinated in the presence of phosphinothricin for 
15 several weeks and then photographed. A. 1561E, B. UQ406, C. Non-transformed (i.e. does not 
carry the nos. BAR gene), D-F. Three tomato lines that carry silent nos.BAR genes. 

Figure 4 is a representation showing methylation of a genetically engineered Ds transposon in 
transgenic tomato. Two separate Southern analyses were conducted on 7 individual genotypes; 

20 genomic DNA was extracted from leaf tissue (5). The restriction enzymes and probes (shaded 
boxes) used are shown on the figure. Lanes: 1. Non transformed (i.e. no Ds or nos. BAR gene), 
2. 1561E which carries an active nos:BAR gene (due to the fact that it has never been exposed 
to the transposase gene), 3-6. Four tomato lines that carry silent nos.BAR genes, 7. UQ406 
which carries an active nos. BAR gene due to insertion of the Ds in the oe-amylase promoter. The 

25 enzymes Sstll (abbreviated Ss) and Notl (abbreviated Nt) are methylation sensitive, whereas 
BstYl (abbreviated Bs) and EcoKL (abbreviated RI) are not. The expected size fragment for 
unmethylated DNA is indicated by the arrow; larger fragments (as in the silent lines) indicate 
methylation of the DNA at the Sstll or Notl sites. 

30 Figure 5 is a representation showing a sequence comparison between the potato cc-amylase 
promoter (15) [SEQ ID NO:2] and the tomato a-amylase promoter [SEQ ID NO:l]. The 
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location of the UQ406 insertion is shown. 

Figure 6 is a representation of a nucleotide sequence [SEQ ID NO:3] of genomic DNA from 
65 1 bp upstream of the Ds insertion in UQ406 to the beginning of the Dem coding sequence, 
5 followed by the Dem cDNA sequence from the ATG start site at base pair 4097. The target 
sequences of the Ds insertion in UQ406 and Dem ATG are underlined. The Dem cDNA 
sequence is shown in italics and underlined. 

Figure 7 is a photographic representation showing a stable mutant and a somatic revertant of 
10 the Dem locus. The seedling at the right in the background is homozygous for the Ds insertion 
in the Dem gene. The stable mutant fails to develop beyond the stage shown in the figure. The 
somatic revertant in the foreground is homozygous for the Ds insertion at the zygotic stage of 
development, but it also inherited a transposase gene which causes Ds excision and reversion of 
the phenotype to wild-type. Somatic revertants are characterized by abnormal cotyledons but 
15 develop a functional shoot meristem due to Ds excision and restoration of Dem function. Each 
somatic revertant represents an independent transposition event. 

Figure 8 is a diagrammatic representation showing an improved transposon tagging strategy 
using Dem as excision marker. The sAc and Ds parent lines are represented by the upper left and 
20 right boxes, respectively. Because the sAc is linked to the dem mutant +7 allele, somatic 
revertants can theoretically occur at about the frequency of 1 out of 4 in the Fl progeny. Each 
somatic revertant represents an independent transposition event. Chr4, chromosome 4 of 
tomato. 

25 Figure 9 is a diagrammatic representation showing plant expression vector pZorz carrying 
Osa:Luc(\2). 
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DET AILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is predicated in part on the elucidation of the molecular basis of 
transposase-mediated silencing of genetic material located within a transposable element. 
5 Although, in accordance with the present invention, the molecular basis of gene silencing has 
been determined with respect to plant selectable marker genes within the Ds element of the 
Ds/Ac maize transposon system, the present invention clearly extends to the silencing of any 
nucleotide sequence and in particular a transgene and to mechanisms for alleviating gene 
silencing. In accordance with the present invention, nucleotide sequences have been identified 
10 which alleviate gene silencing and which increase or stabilise expression of genetic material. 
Furthermore although the present invention is particularly exemplified in relation to plants, it 
extends to all eukaryotic cells such as cells from mammals, insects, yeasts, reptiles and birds. 

Accordingly, one aspect of the present invention provides an isolated nucleic acid molecule 
15 comprising a sequence of nucleotides which increases or stabilizes expression of a second 
nucleotide sequence inserted proximal to said first mentioned nucleotide sequence. 

The term "proximal" is used in its most general sense to include the position of the second 
nucleotide sequence near, close to or in the genetic vicinity of the first mentioned nucleotide 

20 sequence. More particularly, the term "proximal" is taken herein to mean that the second 
nucleotide sequence precedes, follows or is flanked by the first mentioned nucleotide sequence. 
Preferably, the second nucleotide sequence is within the first mentioned nucleotide sequence and, 
hence, is flanked by portions of the first nucleotide sequence. Generally, the second nucleotide 
sequence is flanked by up to about 10 kb either side of first mentioned nucleotide sequence, more 

25 preferably up to about 5 kb, even more preferably to about 4 kb either side of said first 
mentioned nucleotide sequence and even more preferably to about 10 bp to about 1 kb. 

Accordingly, another aspect of the present invention is directed to an isolated nucleic acid 
molecule comprising a sequence of nucleotides which stabilises, increases or enhances expression 
30 of a second nucleotide sequence inserted into, flanked by, adjacent to or otherwise proximal to 
the said first mentioned nucleotide sequence. 
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The term "expression" is conveniently determined in terms of desired phenotype. Accordingly, 
the expression of a nucleotide sequence may be determined by a measurable phenotypic change 
involving transcription and translation into a proteinaceous product which in turn has a 
phenotypic effect or at least contributes to a phenotypic effect. Alternatively, expression may 
5 involve induction or promotion of transcript degradation such as during co-suppression resulting 
in inhibition, reduction or otherwise down-regulation of translatable product of a gene. In the 
latter case, the nucleic acid molecules of the present invention may result in production of 
sufficient transcript to induce or promote transcript degradation. This is particularly useful if a 
target endogenous gene is to be silenced or if the target sequence is from a pathogen such as a 
10 virus, bacterium, fungus or protozoan. In all instances "expression" is modulated but the result 
is conveniently measured as a phenotypic change resulting from increased or stabilised 
production of transcript, resulting in increased or stabilised translation product or increased or 
enhanced transcript production leading to transcript degradation such as in co-suppression 
resulting in loss of translation product. 

15 

The second mentioned nucleotide sequence is preferably an exogenous nucleotide sequence 
meaning that it is either not normally indigenous to the genome of the recipient cell or has been 
isolated from a cell's genome and then re-introduced into cells of the same plant or animal, same 
species of plant or animal or a different plant or animal. More preferably, the exogenous 
20 sequence is a transgene or a derivative thereof which includes parts, portions, fragments and 
homologues of the gene. 

The first mentioned nucleotide sequence described above is referred to herein as an "expression 
modulating sequence" (EMS) since it functions to and is capable of increasing or stabilizing 
25 expression of an exogenous nucleotide sequence such as a transgene or its derivatives. This in 
turn may have the effect of alleviating silencing of an exogenous nucleotide sequence or may 
promote transcript degradation such as via co-suppression. The latter is particularly useful as 
a defence mechanism against pathogens such as but not limited to plant viruses and animal 
pathogens. 

30 

Accordingly, another aspect of the present invention relates to an expression modulating 
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sequence (EMS) comprising a sequence of nucleotides which increases, enhances or stabilizes 
expression of a second nucleotide sequence inserted within, adjacent to or otherwise proximal 
to said EMS. 

5 The term "modulating" is used to emphasise that although transcription may be increased or 
stabilised, this may have the effect of either permitting stabilised or enhanced translation of a 
product or inducing transcription degradation such as via co-suppression. 

The EMSs of the present invention were identified, in accordance with the present invention, 
10 following transposon mutagenesis of plants with the Ds/Ac transposon system. The Ds element 
carries a reporter gene (nos:BAR) which is normally silenced upon exposure to the transposase 
gene. In a few cases, plants are detected in which nos:BAR expression is not silenced. In 
accordance with the present invention, it has been determined that the Ds element inserts within, 
adjacent to or otherwise proximal with an EMS which results in increased or stabilized 
15 expression of the nos.BAR, In other words, the EMS facilitates expression of a gene and 
preferably an exogenous gene or a transgene. This in turn may result in gene product being 
produced or induction of transcript degradation such as via co-suppression. 

The EMSs of the present invention are conveniently provided in a genetic construct. 

20 

Accordingly, another aspect of the present invention contemplates a genetic construct comprising 
an EMS as herein defined and means to facilitate insertion of a nucleotide sequence within, 
adjacent to or otherwise proximal with said EMS. 

25 The term "genetic construct" is used in its broadest sense to include any recombinant nucleic acid 
molecule and includes a vector, binary vector, recombinant virus and gene construct. 

The means to facilitate insertion of a nucleotide sequence include but are not limited to one or 
more restriction endonuclease sites, homologous recombination, transposon insertion, random 
30 insertion and primer and site-directed insertion mutagenesis. Preferably, however, the means is 
one or more restriction endonuclease sites. In the case of the latter, the nucleic acid molecule 
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is cleaved and another nucleotide sequence ligated into the cleaved nucleic acid molecule. 

Preferably, the inserted nucleotide sequence is operably linked to a promoter in the genetic 
construct. 

5 

According to this embodiment, there is provided a genetic construct comprising an EMS as 
herein defined and means to facilitate insertion of a nucleotide sequence within, adjacent to or 
otherwise proximal with said EMS and operably linked to a promoter. 

10 Conveniently, the genetic construct may be a transposable element such as but not limited to a 
modified form of Ds. A modified form of Ds includes a Ds molecule comprising an EMS and 
a nucleotide sequence such as but not limited to a reporter gene, a gene conferring a particular 
trait on a plant cell or a plant regenerated from said cell or a gene which will promote co- 
suppression of an endogenous gene. 

15 

Another aspect of the present invention contemplates a method of increasing or stabilising 
expression of a nucleotide sequence or otherwise preventing or reducing silencing of a nucleotide 
sequence or promoting transcription degradation of an endogenous gene in a plant or animal or 
cells of a plant or animal, said method comprising introducing into said plant or animal or plant 
20 or animal cells said nucleotide sequence flanked by, adjacent to or otherwise proximal with an 
EMS. 

In an alternative embodiment, there is provided a method of inhibiting, reducing or otherwise 
down-regulating expression of a nucleotide sequence in a plant or animal or cells of a plant or 
25 animal, said method comprising introducing into said plant or animal or plant or animal cells the 
nucleotide sequence flanked by, adjacent to or otherwise proximal with an EMS. 

Yet another aspect of the present invention provides a transgenic plant or animal carrying a 
nucleotide sequence flanked by, adjacent to or otherwise proximal to an EMS. As a 
30 consequence of the EMS, the expression of the exogenous nucleotide sequence is increased or 
stabilised resulting in expression of a phenotype or loss of a phenotype. 
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Although not intending to limit the present invention to any one theory or mode of action, the 
EMS is proposed to comprise a methylation resistance sequence. 

According to this aspect, the DNA methylation resistant sequence may prevent inhibition of 
5 transcription or delay mRNA transcript turnover. This can enhance, increase or stabilise an 
transcript and translation into a gene product or may induce or promote transcript degradation 
such as via co-suppression. 

The present invention further provides for an improved transposon tagging system. 

10 

One system employs a modified Ds element which now carries an EMS. 

Accordingly, another aspect of the present invention is directed to an improved transposon 
tagging system, said system comprising a transposable element carrying a nucleotide sequence 
15 flanked by, adjacent to or otherwise proximal with an EMS. 

Another new system employs the Dem gene or its derivatives as an excision marker. Reference 
to "derivatives" include reference to mutants, parts, fragments and homologues of Dem including 
functional equivalents. The Dem gene is required for cotyledon development and shoot and root 
20 meristem function. Stable Ds insertion mutants of Dem germinate but fail to develop any further. 
However, unstable mutants in the Dem locus result in excision of the Ds element and reversion 
of the Dem locus to wild-type, thereby restoring function to the shoot meristem. In accordance 
with the present invention, the new system enables selection for transposition. 

25 In accordance with the improved method, transposition is initiated by crossing a Ds line with a 
stabilized Ac (sAc) line. The Ds line is heterozygous for a Ds insertion in the Dem gene and the 
sAc line is heterozygous for a stable mutation in the Dem gene. A particularly useful mutant in 
the Dem gene is a frameshift mutation. Both of the Ds and sAc containing plant lines are wild- 
type due to the recessive nature of the Ds insertion and mutant alleles. The F, progeny derived 

30 from crossing the Ds and sAc lines segregate at a ratio of 3 wild-types to 1 mutant. Because the 
sAc is linked to the frameshift dem allele, almost all of the F, mutants also inherit the transposase 
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gene and can undergo somatic reversion. These revertant individuals have abnormal 
cotyledons, but Ds excision from the Dem gene restores function to the shoot apical meristem. 
Each somatic revertant represents an independent transposition event from the Dem locus. By 
screening for expression of a gene resident on the Ds element (e.g. nos. BAR), the identification 
5 of EMSs is readily determined. 

The present invention also provides in vivo bioassays for expressed transgenes. The bioassays 
identify nucleotide sequences which prevent transgene silencing. 

10 In one aspect, the plant expression vector pZorz (see Figure 5) carries a firefly luciferase reporter 
gene (luc), under the control of the Osa promoter (12). After bombardment, the gene is 
expressed in embryogenic sugarcane callus. However; it becomes completely silenced upon plant 
regeneration. The silencing appears to be correlated with methylation of the transgene. Genetic 
sequences flanking reactivated nos. BAR insertions are inserted in the pZorz vector at the Hindlll 

15 site upstream from the Osa promoter. These modified pZorz constructs are then used with a 
transformation marker to transform sugarcane in order to test whether the plant sequences are 
capable of alleviating silencing of the luc gene upon plant regeneration. Restriction endonuclease 
fragments capable of alleviating silencing of the luc gene are subcloned by deletion analysis into 
smaller fragments to define the sequence more accurately. 

20 

In another aspect, a plant expression vector is constructed for testing the EMSs in 
Agrobacterium-tiansfonved Arabidopsis. EMSs are placed upstream of the nosduc or nos:gus 
gene linked to a transformation marker and used to test whether EMSs stabilise expression of 
the nosduc or nos:gus gene in Arabidopsis. 

25 

The present invention further described by the following non-limiting Examples. 
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EXAMPLE 1 
Ds/sAc Transposon system 

The inventors have previously developed a two component Ds/sAc transposon system in 
5 transgenic tomato for tagging and cloning important genes from plants (5, 13). The 
components of the system are shown in Figure 1 and comprise: i) a non-autonomous 
genetically-engineered Ds element (e.g. SLJ1561), and ii) an unlinked transposase gene sAc 
(SLJ10512), required for transposition of the Ds element. To activate transposition, the two 
components are combined by crossing transformants for each component. A plant selectable 

10 marker gene, e.g. nos:BAR, is inserted into the Ds element to enable selection for reinsertion 
of the elements following excision from the T-DNA (Figure 1). Surprisingly, the marker gene 
is irreversibly inactivated when the Ds line is crossed to a transformant expressing the 
transposase gene (5). Silencing occurred when the Ds element remained in the T-DNA, and 
also occurred in the great majority of cases when the Ds element transposed to a new location 

15 in the tomato genome. None of the other marker genes in the T-DNA is silenced. The silenced 
marker gene has been shown to be stably inherited, even after the transposase gene segregates 
away from the Ds element in subsequent generations. 

EXAMPLE 2 

20 Transposon tagging of a chromosomal region enabling 

full expression of the nos.BAR transgene 

The experimental strategy for generating tomato lines carrying transposed Ds elements from 
T-DNA 1561E is shown in Figure 2. The Ds element in 1561E carries a nos. BAR marker gene. 

25 In construction of the Ds, the 5' end of the nos promoter is cloned into the Xho I site, 1 100 bp 
from the 3' end of Ac. As a strategy to tag regions of the tomato genome associated with high 
level gene expression, hundreds of plants carrying transposed Ds elements are screened for 
resistance to phosphinothricin (PPT), the selection agent for the BAR gene. Several lines are 
identified which show at least some level of resistance. One line, called UQ406, carries a single 

30 transposed Ds element (without the transposase gene which has segregated away) and is 
resistant to PPT (Figure 3). Stable inheritance of BAR gene expression in this line has been 
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demonstrated through several generations. These results indicate that the strategy for tagging 
active chromosomal regions by screening for PPT resistance is a successful approach. Southern 
hybridization analysis of the original Ds transformant 156 IE, UQ406 and several lines carrying 
silenced nos: BAR transgenes indicates that silencing is correlated with methylation of the Sstll 
5 site in the nos promoter (Figure 4). Total leaf tissue is used in this analysis, and the SstTL site 
in the nos promoter in UQ406 is partially methylated. In silent nos.BAR genes, a NotI site 
immediately downstream from the coding sequence is also methylated (Figure 4). In UQ406, 
the NotI site is unmethylated, as in 156 IE (Figure 4). 

10 EXAMPLE 3 

Cloning sequences flanking active nos.BAR genes 

GenomeWalker (14) is used to clone the tomato DNA sequences flanking the Ds element in 
UQ406. The DNA flanking the Ds element in line UQ406 is cloned and sequenced, and a 

15 search of the PROSITE database reveals that the Ds has inserted into the promoter region of 
an ot-amylase gene. The promoter [SEQ ID NO:l] shows strong homology to an ot-amylase 
promoter of potato (15; Figure 5) [SEQ ID NO:2] and the coding sequence of the gene has 
strong homology with one of 3 reported potato ot-amylase cDNAs (16). The DNA from 651 bp 
upstream of the UQ406 insertion to the end of the Dem coding sequence, has been sequenced 

20 (Figure 6) [SEQ ID NO:3]. 

EXAMPLE 4 

An improved transposon tagging strategy for transgenic tomato 

25 The inventors have used the transposon tagging system described in Example 1 (also see Figure 
2) to tag and clone two important genes involved in shoot morphogenesis. The DCL gene is 
required for chloroplast development and palisade cell morphogenesis (13) and the Dem 
(Defective Embryo Mer stem) gene is required for cotyledon development and shoot and root 
meristem function. Stable Ds insertion mutants of Dem germinate but fail to develop any further 

30 (Figure 7). Figure 7 also shows an example of an unstable mutant of the Dem locus. Upon 
germination, these variegated seedlings appear at first to be mutant. However, the transposase 
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gene activates transposition of the Ds and reversion of the Dem locus to wild-type, thereby 
restoring function to the shoot meristem. 

While the transposon tagging system described in Figure 2 has been successful in tagging genes 
5 and a chromosomal region alleviating transgene silencing, it does have two associated 
inefficiencies. First, transposition cannot be selected in the shoot meristem of V x plants 
heterozygous for Ds and sAc. As a consequence, many TC 1 progeny derived from test-crossing 
these F, plants still have the Ds located in the T-DNA. The other limitation of the system is that 
sibling TC, progeny derived from a single F { plant often carry the same clonal transposition and 
10 reinsertion event. The extent of clonal events amongst sibling TC! progeny can only be 
monitored by time consuming and expensive Southern hybridisation analysis. 

These two inefficiencies in the transposon tagging strategy are overcome in accordance with the 
present invention by using the Dem gene as an excision marker. The new system enables 

15 selection for transposition in the shoot apical meristem and visual identification of plants carrying 
independent transposition events. Transposition is initiated by crossing a Ds line with a sAc line 
(Figure 8). The Ds line is heterozygous for a Ds insertion in the Dem gene and the sAc line is 
heterozygous for a stable frameshift mutation in the Dem gene (Figure 8). The frameshift allele 
is derived from a Ds excision event from the Dem locus. Both the Ds and sAc lines are wild-type 

20 due to the recessive nature of the Ds insertion and frameshift alleles. PCR tests on intact leaf 
tissue have been developed for the rapid identification of these Ds and sAc parental lines. The 
Fi progeny derived from crossing the Ds and sAc lines segregate at the expected ratio of 3 wild- 
types to 1 mutant. Because the sAc is linked to the frameshift dem allele, almost all of the Fj 
mutants also inherit the transposase gene (sAc) and can undergo somatic reversion. These 

25 revertant individuals have abnormal cotyledons, but Ds excision from the Dem gene restores 
function to the shoot apical meristem (see Figure 7). Each somatic revertant represents an 
independent transposition event from the Dem locus. A non-destructive test for nos.BAR 
expression is used involving application of PPT (the selective agent for expression of BAR gene) 
to a small area of a leaf. Somatic revertants resistant to PPT are grown though to seed and the 

30 F 2 progeny are screened again for PPT resistance. Lines carrying transposed Ds elements 
expressing nos.BAR are selected for more detailed molecular analysis. Three independent 
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insertions (UQ1 1, UQ12 and UQ14) carry active nos.BAR genes. The donor Ds was originally 
located in the Dem gene (Figure 4) and in that location in the Dem gene the nos. BAR gene was 
silent. 

5 The efficient saturation mutagenesis of this chromosomal region is dependent on the use of the 
Dem gene as a selectable marker for independent transposition events. A recombinant selectable 
marker for independent transpositions is produced and transformed into tomato for saturation 
mutagenesis in other chromosomal regions of tomato. This system may be introduced into any 
species possessing the dem mutation, in order to facilitate transposon tagging of genes. 

10 

EXAMPLE 5 

A rapid bioassay for identification of tomato DNA sequences 
capable of alleviating transgene silencing in a heterologous plant species 

15 An efficient transformation system has been developed for sugarcane, based on particle 
bombardment of embryogenic alleles, followed by plant regeneration (17). The bioassay is useful 
for identifying tomato sequences which prevent transgene silencing and employs the plant 
expression vector pZorz (Figure 9). This plasmid carries a firefly luciferase reporter gene (luc), 
under the control of the Osa promoter (12). After bombardment of embyrogenic callus of sugar 

20 cane, the luciferase gene is expressed as observed by visualisation of the chemiluminescence of 
the luciferase enzyme. However, it becomes completely silenced upon plant regeneration in 
normal sugar cane. This is used to test the system. The silencing appears to be correlated with 
methylation of the transgene. Tomato sequences flanking reactivated nos. BAR insertions are 
inserted in the pZorz vector at the Hindlll site upstream from the Osa promoter (Figure 10). 

25 These modified pZorz constructs are then used with a transformation marker to transform 
sugarcane in order to test whether the tomato sequences are capable of alleviating silencing of 
the luc gene. They are then subcloned by deletion analysis into smaller fragments to more 
accurately define the sequences. 

30 Tomato sequences flanking reactivated nos.BAR insertions are also introduced next to a 
nos. BAR, nos. LUC or nos. GUS recombinant gene in another plasmid vector. These modified 
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recombinant BAR, LUC and GUS genes are inserted into binary vectors (4) for transformation 
into Arabidopsis thaliana (18) to test the ability to prevent silencing of the nos:BAR gene in 
Arabidopsis. 

5 EXAMPLE 6 

Analysis of sequences responsible for reactivating nos:BAR expression 

The borders of DNA elements that prevent transgene silencing are initially defined by deletion 
analysis of clones that yield positive results in the bioassays. The smallest active clone for each 
10 chromosomal region is then sequenced and characterised in detail. Sequences from independent 
Ds insertions are compared for homologous DNA elements. 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood that 
15 the invention includes all such variations and modifications. The invention also includes all of the 
steps, features, compositions and compounds referred to or indicated in this specification, 
individually or collectively, and any and all combinations of any two or more of said steps or 
features. 

20 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TTTGAAATTT 


ATGTATTTAT 


CTATAGCATT 


AGAAACTATA 


AGAGTTGTTA 


GCTTCACTT 3 


60 


GCTTACTGTT 


GTGCTCAAAG 


CAACTTCATC 


ATCATACAGT 


ATGGTTTTGA 


TATGCTCTTC 


120 


CATTATCACT 


GAGCCTTATG 


ATTATGTTTT 


ACGAGCTTAT 


AATATCACTG 


ATGGTGATTC 


180 


AGTATTGTGA 


TTATGTCCTT 


CGTTGATTAT 


TCTGTTTCAT 


ACAAGTCGTG 


TAATTTGCTG 


240 


TTTGTGACAG 


TACGATAGAT 


CGACTCAACC 


TTC TGAGGTA 


TTAGTTGAAG 


TTCATGTAAA 


300 


TTAGCTTTGT 


TTATCATAGT 


AGCATTTGAT 


TATTGATGCT 


CTGTAGCTAA 


TGATAAGCCA 


360 


TTGGAGGGAA 


GCAAGCTTTC 


TAAATGAATC 


TACGAATGGA 


TGATAAAGTT 


CATGAATATT 


420 


TTTfiTTACTT 


CTGCAGTCAG 


ATCATGAGTT 


ATTGAGTCTA 


TTGTTTTTTT 


AAGCCTGTTT 


480 


CAGATGATCC 


ATCATCAGTA 


ACAACATACA 


CGGTGTAGTC 


CCAAATCCAT 


C ATATGC AC C 


540 


TTCTTTTCTT 


CAATTTGGTC 


TTGTTTTTTT 


TTTTTCATGA 


TGTCATTGAA 


TTATTCAAGA 


600 


AGTCACTTCG 


AGCATAATGA 


TTTTTCAAAA 


TCCACCTTTG 


TTCAAGCACT 


ACCACGTCTT 


660 


TTCATCTAGC 


CCACAACCGT 


GGTGGAGGAT 


C T AG AATTTT 


CATGAAAGGA 


TTCAAAATTT 


720 


ACAAACATAT 


ATATACACTA 


TACACTATGA 


ATCCACTAAT 


ACTAGATGGT 


GCACCTGTGC 


780 


CCCCACTCAT 


GTGAAAGCCT 


ATTCTCAATT 


TTTTATTTTC 


CACAACTTAA 


ATACAGACCG 


840 


CACAACTCCC 


GTGTCTTGTG 


TGCTCGTCGC 


TCAGCATGCA 


AGTCGAGAAA 


AGAAAGACCA 


900 


AAACAATGAA 


AACTTTACGA 


AAAATCAAAA 


AGTTGAAGGA 


CTTTAACGTC 


GAGATCTCTC 


960 


GTAGAAAACC 


TCTTTTGTAA 


GGTTGCATAC 


AATACTTTTT 


TTTCAGACTT 


T AC TTATGGT 


1020 


ATTATACTGA 


ATATGTTATT 


GCTGTTATAG 


TAGTTGAGTG 


ACGTTTGAGG 


G AATTTC TAG 


1080 


TCCGTTAATC 


TTGTACTCAG 


TGTGTCTACT 


TTTCAAAAAA 


GTCAGTTTTT 


CAGTCTCTAA 


1140 


AACACATTTA 


AATAAGAGTT 


TCTTTGCCCA 


TCTTTTGTTC 


CTCATCCTAG 


GCTTGGAGTC 


1200 


AACACAACAC 


AACAACA 










1217 
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{2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



TTTGAAATTT 


ATGTA^ATAT 


CTGTAGCATT 


AGAAACTATA 


AGAGTTGTTA 


GCTTCACTTG 


TCTTATTGTT 


GTGCTCAAAG 


CAACTTCATC 


ATACAGTATG 


GTTTTTATAT 


GCTCTTCCAT 


TATCACCGAA 


CCTTATGATT 


ATGTGTACGA 


GCTTATAATA 


TTACTGATGG 


TGATTCAGTA 


TTATGATTAT 


GTCCTCCATT 


AATTATTCTG 


TTTCATACAA 


GTCGTGTAAT 


TTGCTGTTTG 


TGATTGTACG 


ATAAATTGAT 


TCAACCTTCT 


GCGGTGTTGG 


TTGAAGTTCA 


AGTAAATTAG 


CTTTATTTAT 


CATAGTAGCA 


TTTGATTATT 


GATGCTCTGT 


AGCTAATGAT 


AAGCCATTGA 


AGGGAAGCAG 


AAATGGTAAA 


GCTTTCTAAA 


ATGAATCTAC 


GAATGGATGA 


TAAAGTTAAT 


GAATATTGTT 


GATACTTCTG 


CAATCAGATT 


ATGAGTTACT 


GAGTCTACTG 


TTTTTTAAGC 


CTGTTTCAGA 


TGATCGATCA 


TCAACAACAA 


CATATTCAGT 


GTAGTAGACA 


TGATCGATCA 


CTTTCTAATT 


TTCGATTATG 


CACCCTCTTT 


TCTCCAATTT 


GGTCGTCTTC 


TTTTTTTCAT 


GATGTCACTG 


AATTATTCTC 


TGGTCGTCCC 


CACCATTCAG 


GAAGTCACTT 


CGAGCATAAT 


GTGAAAACAT 


CCACATTTTT 


CAAATCCAGC 


AGAATTTTCA 


TCAAACGGGG 


TTCAACATTT 


ACTACATGTA 


TACACTCTGA 


AGTCTGAATC 


CACTAATTCT 


AGATGGTGCA 


TCTGTGCCCC 


CACACTTGTG 


AAAGCTTATT 


CTCAATTTTT 


TATTTTCCAA 


CAACTTGAAT 


TCAGACCACA 


CAACTCCCGT 


GTCTTGTACG 


GTCAGCATCT 


GAGTGGAGAA 


CTCAATTAAG 


TGACTTTAAC 


GTCGAGTTCT 


ATAGTAAACA 


ACCCCTATAT 


CTTTTTTCAA 


GCATGTTAAG 


ATTGCGAACA 


CACTGAAATT 


TCCAGGTCGT 


TAATCTTGTA 


CCCAGTGTGT 


GTACTTTTAA 


AAAAAAAAGT 


CAGTTTTTTA 


GTCTCTAAAA 


CACATTTAAA 


TAGAGTTTAT 


TTGCCATCTT 


TTGTTCCTCA 


TACTAGACTT 


CGGAGTCAAC 


ACAACACAAC 


AACA 







(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 




240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



840 
900 




960 
1020 
1080 
1114 
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CGACGGCCCG 


GGC TGGTAAA 


TGCGGAAGCT 


TGTTACAGAT 


TTGAAATTTA 


TGTATTTATC 


60 


TATAGCATTA 


GAAACTATAA 


GAGTTGTTAG 


CTTCACTTGG 


CTTACTGTTG 


TGCTCAAAGC 


120 


AACTTCATCA 


TCATACAGTA 


TGGTTTTGAT 


ATGCTCTTCC 


ATTATCACTG 


AGCCTTATGA 


180 


TTATGTTTTA 


CGAGCTTATA 


ATATC AC TG A 


TGGTGATTCA 


GTATTGTGAT 


TATGTCCTTC 


240 


GTTGATTATT 


CTGTTTCATA 


CAAGTCGTGT 


AATTTGCTGT 


TTGTGACAGT 


ACGATAGATC 


300 


GACTCAACCT 


TCTGAGGTAT 


TAGTTGAAGT 


TCATGTAAAT 


TAGCTTTGTT 


TATCATAGTA 


360 


GCATTTGATT 


ATTGATGCTC 


TGTAGCTAAT 


G AT AAGC CAT 


TGGAGGGAAG 


CAAGCTTTCT 


420 


AAATGAATCT 


ACGAATGGAT 


GATAAAGTTC 


ATGAATATTT 


TTGTTACTTC 


TGCAGTCAGA 


480 


TCATGAGTTA 


TTGAGTCTAT 


TGTTTTTTTA 


AGCCTGTTTC 


AGATGATCCA 


TCATCAGTAA 


540 


CAACATACAC 


GGTGTAGTCC 


CAAATCCATC 


ATATGCACCT 


TC TTTTC TTC 


AATTTGGTCT 


600 


TGTTTTTTTT 


TTTTCATGAT 


GTCATTGAAT 


TATTCAAGAA 


GTCACTTCGA 


GCATAATGAT 


660 


TTTTCAAAAT 


CCACCTTTGT 


TCAAGCACTA 


CCACGTCTTT 


TCATCTAGCC 


CACAACCGTG 


720 


GTGGAGGATC 


TAGAATTTTC 


ATGAAAGGAT 


TCAAAATTTA 


CAAACATATA 


TATACACTAT 


780 


ACACTATGAA 


TCCACTAATA 


C TAG ATGG TG 


CACCTGTGCC 


CCCACTCATG 


TGAAAGCCTA 


840 


TTCTCAATTT 


TTTATTTTCC 


AC AAC TTAAA 


TACAGACCGC 


ACAACTCCCG 


TGTCTTGTGT 


900 


GCTCGTCGCT 


CAGCATGCAA 


GTCGAGAAAA 


GAAAGACCAA 


AACAATGAAA 


ACTTTACGAA 


960 


AAATCAAAAA 


GTTGAAGGAC 


TTTAACGTCG 


AGATCTCTCG 


TAG AAAAC C T 


CTTTTGTAAG 


1020 


GTTGCATACA 


ATACTTTTTT 


TTC AG AC TTT 


ACTTATGGTA 


TT AT AC TG AA 


TATGTTATTG 


1080 


CTGTTATAGT 


AGTTGAGTGA 


CGTTTGAGGG 


AATTTCTAGT 


CCGTTAATCT 


TGTACTCAGT 


1140 


GTGTCTACTT 


TTCAAAAAAG 


TCAGTTTTTC 


AGTCTCTAAA 


ACACATTTAA 


ATAAGAGTTT 


1200 


CTTTGCCCAT 


CTTTTGTTCC 


TCATCCTAGG 


CTTGGAGTCA 


ACACAACACA 


ACAACAATGA 


1260 


ATTTCCATTT 


TTCTGTTTCT 


TT AC TTC TC T 


CTTTATCTCT 


TCCTATGTTT 


GCCTCTTCGA 


1320 


CGGTGTTATT 


TCAGGTATCC 


ATCTCCAAAG 


AACCTTATTT 


TTC TC TT AAC 


TTTTC CTATG 


1380 


TATATGTATC 


TCTATGTTTA 


TGTAGTACTT 


GCTCAAGTAT 


ATAAAGAAAA 


GTTAGTTTCT 


1440 


CTAGAATCTT 


TGAATTCATT 


TGTTAGGGGT 


TCAATTGGGA 


TTCGAGTAAT 


AAGCAAGGCG 


1500 


GATGGTACAA 


CTCTCTCATC 


AAC TTAGTTC 


CGGACTTGGC 


TAAAGCTGGA 


GTTACTCATG 


1560 


TTTGGTTGCC 


ACCATCATCT 


CACTCCGTTT 


CTCCTCAAGG 


TAATTTTCGG 


AGTGATTGTG 


1620 


ACCTAGTAAT 


CCAATGAAGT 


CAAAATAACC 


ACGGAAGATT 


AGAGTCTAAA 


TTTTAATGAA 


1680 


AATAGTTCAG 


ACAAGTTAAT 


GACCAACTTA 


TATATTAGTT 


CAATCCATAA 


AATTTGATGT 


1740 


AGTAGTTACA 


AAATGGAATT 


GCTTGAAGGC 


TTATGCCATG 


TTTTATGCCA 


GGTTATATGC 


1800 


CAGGAAGGTT 


GTATG AC TAG 


GATGCTTCCA 


AGTTTGGAAA 


TCAGCAACAA 


C TG AAAAC TC 


1860 


TTATTAAGGC 


TTTAACATGA 


CCACGGGATC 


AAATCGGTTG 


CTGATATAGT 


GATAAATCAT 


1920 


AGAACTGCTG 


ATAACAAAGA 


TAGCAGGGGA 


ATATACAGCA 


TCTTTGAAGG 


AGGAACATCT 


1980 


GATGACCGGC 


TTGATTGGGG 


TCCATCTTTC 


ATTTGCAGGA 


ACGACACACA 


ATATTCTGAT 


2040 
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GGCACGGGGA 


ATCCAGACAC 


GGGTTTGGAC 


TTTGAACCTG 


CACCTGATAT 


CGATCATCTT 


2100 


. AATACGAGAG 


TGCAGAAAGA 


GTTATCAGAC 


TGGATGAACT 


GGCTGAAATC 


TGAAATTGGA 


2160 


TTTGATGGTT 


GGCGTTTCGA 


TTTTGTTAGG 


GGATATGCAC 


CTTGCATTAC 


CAAAATTTAT 


2220 


ATGGGAAACA 


CGTCCCCGGA 


TTTTGCTGTT 


GGTGAATTGT 


GGAACTCTCT 


TGCTTATGGC 


2280 


CAGGACGGGA 


AACCGGAATA 


TAACCAGGAC 


AATCATAGAA 


ATGAGCTAGT 


TGGTTGGGTA 


2340 


AAAAATGCGG 


GGCGGGCTGT 


AACAGCTTTT 


GATTTTACAA 


CAAAGGGAAT 


TCTTCAAGCT 


2400 


GCAGTTCAAG 


AAGAGTTATG 


GAGATTGAAG 


GATCCCAATG 


GAAAACCTCC 


TGGGATGATC 


2460 


GGTGTTTTGC 


CTCGAAAAGC 


TGTGAC TTTT 


ATCGATAATC 


ATGATACTGG 


ATCGACACAA 


2520 


AATATGTGGC 


CTTTCCCTTC 


AGACAAAGTT 


ATGCAAGGAT 


ATGCATACAT 


TCTTACTCAT 


2580 


CCAGGAATCC 


CATCCGTGGT 


AAAAAAAATA 


AATAAATTCT 


TTCTACATAT 


CTCATTGTTT 


2640 


TCTATTTTAC 


AAGAAATTTA 


TATTCTTTTC 


CAGGGGATTT 


GAGAAACTCG 


GCCTGTGGGA 


2700 


GTTTGCTCAC 


ATTGCCAGTC 


TCGTAATCCA 


TAAACAAACA 


CTCAAACTCT 


GAGTGTGCAC 


2760 


ATCTAGACAC 


CTCAACTCGT 


TTTTCACCGT 


GTTAATTGAA 


CACTTCAACT 


TACAAAATGA 


2820 


TCGTGTAGCA 


CCTCCAAAAA 


TTATGTGTCA 


CAATTAGCCA 


CGTGCGAGAT 


ACACGAAAAT 


2880 


GAGTTGGAGT 


AGTTAGTTGC 


CAAATAAAAC 


CAAGCTGAGG 


TGTCTAAATG 


TGCACNCTCA 


2940 


AAGTNGGATG 


TTTACTTGGC 


AGCTGAGGCC 


GAGGCCATGT 


TTGANTGTTA 


TGCTTATAGG 


3000 


ATATGACACA 


TTTGTTTCCG 


ATTAGCTGAG 


GANTTGATTA 


AATCCTNGTT 


TTNGTTNGCA 


3060 


GTTTNATNAC 


CATTNCTTTG 


ATNGGGGCTN 


CNAGGATGGA 


ATTNCAGCAC 


TAANCTCTAT 


3120 


TAGGAAAAGG 


AATAGGATTT 


GTGCANCAAG 


CAATGTGCAA 


ATAATGGCTC 


CTGATTCTGA 


3180 


ATCTTTATAT 


ANCAATGGAT 


CATCACAAAA 


TCATTGTCAA 


GATTGGACCA 


AAACTTGATC 


3240 


TTGGAAATCT 


TATTCCACCT 


AATTATGAGG 


TGGCAACTTC 


TGGACAAGAC 


TATGCTGTAT 


3300 


GGGAGCAAAA 


GGCATAATCA 


TATTGTAC C A 


CACTAAAAGG 


GACCATGGCC 


ACAATGGTTC 


3360 


TCATTAGTGT 


TAATGTTATA 


TGATTGAAAA 


TGTAATTTAT 


ATTGACATAA 


TGAAGGCCAA 


3420 


AAATTCAAGA 


AATTATAAAC 


AATTCAATAG 


TCCTTGCTCA 


ATTCACAATT 


AC ATT ATG AC 


3480 


TTCTCTATTG 


CAAACTAGTT 


TGGGTCCACA 


TTATTGTCTC 


CTAAAATTTT 


ACAACATTTC 


3540 


TTAAGGGAAC 


TTAATTAGTT 


ACAGTGAACA 


TATGTTGAAA 


TTACCCTTTA 


TCCCCTTACA 


3600 


ATTGATTTAA 


TAAATATTTC 


CCCTATCCCT 


TTGGTAGTTG 


GTTAGAGTTA 


TAAGTAACGT 


3660 


AGAGATTAGT 


TATAAGAGAA 


TTTATGTATT 


ATTATGCAGA 


TGTTTAGTTA 


TATCGATTTT 


3720 


AGTTATTTAT 


ATGTTGATTA 


TTTCACCTTC 


AATAATGCAT 


ATAAAGATGG 


TAAATGATTG 


3780 


GATTGATCGA 


ATTCGAATGA 


GTTTGAATAT 


GAACTAATCT 


TCAAATTTAA 


TATAAATTTT 


3840 


TTTTGTCAAC 


ATCTATAGCC 


AAACGGCTCC 


AAAACAATAA 


ATAATTTACA 


TTTATTGTAG 


3900 


TATTTTATTT 


AAAATGGGAT 


NTTCCTCATC 


CCACTTGTAC 


CAGTTGAAAC 


CCTAATAATA 


3960 


AGCCAATCCA 


ACCGTCAAAA 


TTACAAATTT 


TGAAAATTGC 


GCTCCTCACA 


GTTCTCCCCT 


4020 


ATTCAGATTT 


GATTCATTCT 


CTTCATTTTT 


TGTTTTCACA 


TTTTACCTCT 


AAATCAACTC 


4080 
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GAGTCCCTTT 


GTTCAAATGG 


GTGCTAATCA 


CAGCCGTGAA 


GATCTGGAGC 


TTTCTGATTC 


4140 


CGAGTCTGAA 


TCCGAATATG 


GGTCCGAGTC 


TCGAACAAGG 


GAGGAAGAGG 


AAGACGAAGA 


4200 


TAACTACTCA 


GATGCTAAAA 


CGACGCCGTC 


TTCCACTGAT 


CGGAAACAGA 


GCAAAACCCC 


4260 


GTCTTCTTTG 


GATGATGTTG 


AAGCAAAGCT 


GAAAGCTTTA 


AAGCTTAAGT 


ATGGTACTCC 


4320 


TCATGCTAAA 


ACCCCCACAG 


CGAAAAACGC 


TGTTAAACTT 


TACCTTCATG 


TTGGTGGGAA 


4380 


CACTGCGAAT 


TCCAAATGGG 


TAGTTTCTGA 


TAAGGTGACA 


GCTTATTCGT 


TTGTTAAATC 


4440 


GGGTAGTGAG 


GATGGATCGG 


ATGATGATGA 


AAATGAAGAA 


ACTGAGGAGA 


ATGCTTGGTG 


4500 


GGTTTTGAAA 


ATTGGGTCGA 


AGGTTCGGGC 


TAAGATTGAT 


GAGAATTTGC 


AGCTCAAGGC 


4560 


ATTTAAGGAG 


CAGAAAAGGG 


TGGATTTTGT 


GGCGAATGGG 


GTTTGGGCTG 


TGAGATTCTT 


4620 


TGGGGAGGAA 


GAGTATAAGG 


CGTTCATTGA 


C TTATATC AG 


AGCTGTTTGT 


TTGAGAATAC 


4680 


TTATGGGTTT 


GAGGCAAATG 


ATGAGAATAG 


AGTTAAGGTG 


TATGGTAAAG 


ACTTTATGGG 


4740 


GTGGGCAAAT 


CCAGAAGCTG 


CGGATGATTC 


AATGTGGGAG 


GATGCTGGGG 


ATAGCTTCGC 


4800 


GAAGAGCCCT 


GCGTCTGAAA 


AGAAGACACC 


TTTGAGGGTT 


AACCATGATT 


TGAGGGAGGA 


4860 


GTTTGAGGAG 


GCAGCTAAAG 


GAGGAGCTAT 


TCAGAGCTTG 


GCATTAGGTG 


CGTTGGATAA 


4920 


TAGTTTTCTT 


ATAAGTGATT 


CTGGAATTCA 


GGTTGTGAGG 


AAC TAT AC TC 


ATGGAATAAG 


4980 


TGGAAAAGGT 


GTTTGTGTCA 


ATTTTGATAA 


GGAAAGGTCT 


GCTGTACCTA 


ATTCCACTCC 


5040 


AAGGAAAGCT 


CTACTTCTAA 


GAGCTGAGAC 


TAATATGCTT 


CTCATGAGTC 


CAGTGACTGA 


5100 


TAGAAAGCCT 


CACTCTCGGG 


GATTACATCA 


GTTTGATATC 


GAGACTGGGA 


AGGTTGTTAG 


5160 


CGAGTGGAAG 


TTTGAGAAAG 


ATGGAACTGA 


TATCACGATG 


AGGGATATCA 


C T AATG AT AG 


5220 


CAAAGGAGCT 


CAGATGGATC 


CTTCGGGGTC 


TACTTTCTTA 


GGGCTAGATG 


ATAACAGATT 


5280 


GTGTAGGTGG 


GATATGCGTG 


ATCGGCATGG 


GATGGTC C AG 


AATC TAGTTG 


ATGAAAGTAC 


5340 


TCCTGTGCTG 


AATTGGACTC 


AAGGACATCA 


ATTTTCGAGG 


GGAACTAACT 


TTCAGTGCTT 


5400 


TGCTACTACT 


GGTGATGGAT 


CAATTGTTGT 


TGGTTCACTT 


GATGGCAAGA 


TTAGATTGTA 


5460 


CTCAAGCAGT 


TCCATGAGAC 


AGGCTAAAAC 


TGCTTTTCCA 


GGCCTTGGTT 


CTCCTATCAC 


5520 


TCATGTGGAT 


GTTACCTATG 


ATGGGAAGTG 


GATATTGGGG 


AC AAC TG AT A 


CTTACTTGAT 


5580 


ATTGATATGC 


ACCTTGTTTA 


TCGACAAGAA 


TGGAACTACT 


AAGACTGGTT 


TTGCTGGTCG 


5640 


CATGGGAAAT 


AAGATTTCCG 


CTCCAAGATT 


GTTAAAGCTA 


AACCCTCTCG 


ATTCACATAT 


5700 


GGCTGGAGCT 


AACAAGTTCC 


GCAGTGCTCA 


ATTTTCATGG 


GTCACCGAGA 


ATGGGAAGCA 


5760 


AGAGCGCCAC 


CTCGTTGCTA 


CTGTTGGGAA 


GTTTAGTGTG 


ATCTGGAATT 


TTCAACAGGT 


5820 


GAAGGATGGT 


TCTCATGAGT 


GTTACCAGAA 


TCAGGTTGGG 


TTGAAGAGCT 


GCTATTGTTA 


5880 


CAAGATAGTC 


CTAAGAGACG 


ACTCTATTGT 


AGAAAGTCGT 


TTCATGCATG 


ACAAGTACGC 


5940 


TGTTTCTGAC 


TCACCTGAAG 


CACCACTGGC 


GGTAGCAACC 


CCCATGAAAG 


TCAGCTCATT 


6000 


CAGCATCTCT 


AGCAGGCGCT 


TACAAATTTG 


AACAATCATT 


CTGTTCATAT 


ACGCAACTTA 


6060 


TTAGATTTAT 


CTGTAGCAGA 


ATTAGTGTCT 


C TC AC AC T AA 


GTAGCTTGAA 


AAACTGCACA 


6120 
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TCTGCAAATC ATTTCCAGTT CAATGTATTA CTACTTTAGT TTAAAAACCT TAAAAGGCAG 
TCTTCCAAAT TCTAGGTATC CTCACCTGAC ATTATTATTG TTGTAATAGC TAATTGTTGC 
TTGCTCTAAA TCCCCGTTCA ATG 
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SLJ1561 Ds element 




BAR probe NWWNN 
3 , Ss 

1 : 1 


lys-2 probe IWWWI 
Bs Nt Rl 

: ^- 


5' 


noo.BAR 


lys-2 


1 



Sst II + BstYI digest, BAR probe Not I + EcoRI digest, lys-2 probe 
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FIGURE 5 (i) 

981 TTTGAAATTTATGTATATATCTGTAGCATTAGAAACTATAAGAGTTGTTA 1030 POtatO 

Ml II II III Mill I Mill III II III Mill Ml II Ml MM II 

40 TTTGAAATTTATGTATTTATCTATAGCATTAGAAACTATAAGAGTTGTTA 89 TOHiatO 
1031 GCTTCACTTGTCTTATTGTTGTGCTCAAAGCAACT . . . TCATCATACAGT 1077 

1 1 1 1 1 1 1 1 1 1 MM I II 1 1 II II II II 1 1 1 1 II IMMIMMM 

90 GC TTC ACTTGGC TT AC TGTTGTGC TCAAAGC AACTTC ATC ATC AT AC AGT 139 
1078 ATGGTTTTTATATGCTCTTCCATTATCACCGAACCTTATGATTATG . TGT 1126 

1 1 1 1 1| II 1 1 1 1 M t f I f 1 1 1 1 f 1 1 1 1 1 M M 1 1 1 1 1 1 1 M 1 1 I I ion 

140 ATGGTTTTGATATGCTCTTCCATTATCACTGAGCCTTATGATTATGTTTT 189 
1127 ACGAGCTTATAATATTACTGATGGTGATTCAGTATTATGATTATGTCCTC 1176 

Mill MM II MM MINI Mill Mill III II II III Mill 

■ 190 ACGAGCTTATAATATCACTGATGGTGATTCAGTATTGTGATTATGTCCTT 23 9 
1177 CATTAATTATTCTGTTTCATACAAGTCGTGTAATTTGCTGTTTGTGATTG 1226 

i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 i 

240 CGTTGATTATTCTGTTTCATACAAGTCGTGTAATTTGCTGTTTGTGACAG 289 

1227 TACGATAAATTGATTCAACCTTCTGCGGTGTTGGTTGAAGTTC AAGTAAA 1276 
II I || I I I I I I I I II M I I I II I I I I I M II I I I I I II I II II 
2 90 TACGATAGATCGACTCAACCTTCTGAGGTATTAGTTGAAGTTCATGTAAA 339 

1277 TTAGCTTTATTTATCATAGTAGCATTTGATTATTGATGCTCTGTAGCTAA 1326 

Mill II I I I II I I M II II II I I M I I M M I M M II M I II I II II 
340 TTAGCTTTGTTTATCATAGTAGCATTTGATTATTGATGCTCTGTAGCTAA 389 

1327 TGATAAGCCATTGAAGGGAAGCAGAAATGGTAAAGCTTTCTAAAATGAAT 1376 

Mill III HIM llllllll I III Mil I M I I I 1 M 

390 TGATAAGCCATTGGAGGGAAGC AAGCTTTCT . AAATGAAT 428 

1377 CTACGAATGGATGATAAAGTTAATGAATATTGTTGATACTTCTGCAATC A 1426 

HIM II llllllll llllll lllllllll III MIIIIMM Ml o 

429 CTACGAATGGATGATAAAGTTCATGAATATTTTTGTTACTTCTGCAGTCA 47 8 
1427 GATTATGAGTTACTGAGTCTACTG . TTTTTTAAGCCTGTTTCAGATGATC 1 4 7 5 

Ml llllllll II MM 1 1 II MUM lllllllll llllllll II 

479 GATC ATGAGTTATTGAGTCTATTGTTTTTTTAAGCCTGTTTC AGATGATC 528 

147 6 GATCATCAACAACAACATATTCAGTGTAGTAGACATGATCGATCACTTTC 1525 

II I II I I I M I II I I I I M M II I I I M I M I 
529 CATC ATCAGTAACAACATACACGGTGT AGT . . CCCAAATCCATCA 571 

1526 TAATTTTCGATTATGCACCCTCTTTTCTCCAATTTGGTC . . GTCTTCTTT 1573 

llllllll IMMIM MIIIIMM II II Ml cin 

572 TATGCACCTTCTTTTCTTCAATTTGGTCTTGTTTTTTTT 610 

1574 TTTTCATGATGTCACTGAATTATTCTCTGGTCGTCCCCACCATTCAGGAA 162 3 

-II M I M II II II I II 1 1 II IMMIM 

611 TTTTCATGATGTCATTGAATT ATTCAAGAA 640 

1624 GTCACTTCGAGCATAATG . . . TGAAAACATCCACATTT . TTCAA 1663 

Ml I I I I I I I I I MUM I II llllll Ml HIM 

641 GTCACTTCGAGCATAATGATTTTTCAAAATCCACCTTTGTTCAAGCACTA 690 

UQ406 
insertion 
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FIGURE 5 (ii) 

1664 ATCCAGC AGAATTTTC 1679 

III III MINIM! 
691 CCACGTCTTTTCATCTAGCCCACAACCGTGGTGGAGGATCTAGAATTTTC 740 

1680 ATCAAACGGGGTTCAACATTTAC . . . TACATGTATACACTCTGAAGTCTG 1726 

ii iii ii inn 1 1 in i i ii iiiiii ii i i i ii 00 

741 ATGAAA . . GGATTCAAAATTTACAAACATATATATACACTATACACTATG 788 
1727 AATCCACTAATTCTAGATGGTGCATCTGTGCCCCCACACTTGTGAAAGCT 1776 

1 1 II MM I II MINIMUM I III I Ml III I I 1 1 1 1 1 1 1 1 1 

789 AATCCACTAATACTAGATGGTGCACCTGTGCCCCCACTCATGTGAAAGCC 838 
1777 TATTCTCAATTTTTTATTTTCCAACAACTTGAATTCAGACCACACAACTC 1826 

III IIIIII MIIMI MM| Ml II iiiiii flS7 

839 TATTCTCAATTTTTTATTTTCC . ACAACTTAAATACAGACCGCACAACTC 887 
1827 CCGTGTCTTGT ACGGTCAGCATCTGAGTGGAGAACTCAA . . . . 1865 

I Ml III II 1 1 II III MM III Mill II 

888 CCGTGTCTTGTGTGCTCGTCGCTCAGCATGCAAGTCGAGAAAAGAAAGAC 937 

1866 * ' TTAAGTGACTTTAACG 1881 

II I I I I 1 I I 1 I I I 
938 CAAAACAATGAAAACTTTACGAAAAATCAAAAAGTTGAAGGACTTTAACG 987 

1882 TCGAGTTCTATAGTAAACAACCCCT ATATCTT 1913 

Mill III I III I MM II Ml II 

988 TCGAGATCTCTCGTAGAAAACCTCTTTTGTAAGGTTGCATACAATACTTT 1037 

1914 TTTTCAAGCATGTTAAGATTGCGAACAC ACTGA 1946 

MM II I Ml II I I MMI 

1038 TTTTTCAG . ACTTTACTTATGGTATTATACTGAATATGTTATTGCTGTTA 1086 
!947 . AATTTCC AGGTCGTTAATCTTGTACC 1972 

MUM II IIIIIIIIIIIMI 

1087 TAGTAGTTGAGTGACGTTTGAGGGAATTTCTAGTCCGTTAATCTTGTACT 1136 
1973 CAGTGTGTGTACTTTTAAAAAAAAAAGTCAGTTTTTTAGTCTCTAAAACA 2022 

iiiiMii iiiiii i ii mmmiMi ■ 

1137 CAGTGTGTCTACTTTT . . . CAAAAAAGTCAGTTTTTCAGTCTCTAAAACA 1183 
2023 CATTTAAAT . AGAGTTTATTTG . CCATCTTTTGTTCCTCATACTAGACTT 2070 

. IMIIIIII MIIMI MM IMIIIMIIIIIIIIM MM Ml 

1184 CATTTAAAT AAGAGTTTCTTTGCCCATCTTTTGTTCCTCATCCTAGGCTT 1233 

2071 CGGAGTCAACACAACACAACAACA 2094 

I I I I II I I I II M M I I I I I I I I 
1234 . GGAGTCAACACAAC ACAACAACA 12 56 




FIGURE 6 (i) 



i 
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101 
151 
201 
251 
301 
351 
401 
451 
501 
5S1 
601 
651 
701 
751 
801 
851 
901 
951 
1C01 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
19S1 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2S01 
2S51 
2901 
2951 
3001 
3051 
3101 



TGTTACAGAT TTGAAATTTA 
GAGTTCT7AG CTTCACTTGG 
TCATACAGTA TGCTTTTGAT 
TTATGTTTTA CGAGCTTATA 
TATGTCCTTC GTTGATTATT 
TTGTGACAGT ACGATAGATC 
TCATCTAAAT TAGCTTTGTT 
TGTAGCTAAT GATAAGCCAT 



CGACC-GCCCC GGCTGGTAAA TGCGGAAGCT 
TGTATTTATC T AT AGCAT 7 A GAAACTATAA 

cttactgttg tgctcaaagc aacttcatca 

ATGCTCTTCC ATTATCACTG AGCCTTATGA 

atatcactga tggtgattca gtattgtgat 
ctgtttcata caagtcgtct aatttgctgt 
gactcaacct tctgaggtat tagttgaagt 

JESESS acg^^x GATAAAGTTC 
SSfSJJ? TGCAGTCAGA TCATGAGTTA TTGAGTCTAT 

i^St^S SStottSc agatgatcca tcatcagtaa caacatacac 

S^SgTcC CAAATCCATC ATATGCACCT TCTTTTCTTC AATTTGGTCT 
SS^S <fTar TC AAT TATTCAAGAA O^CA^I^ 

<^taatg7t ttt«aaaat ccacctttgt tcaagcacta ccacgtctto 

?S^C CACAACCGTG G7GGAGGATC TACAATTJTC ATGAAAGGAT 

sasss ssssss sssss jsssss essss 

SS£5£§ aSSSSS SSgaccgc acaactcccg tgtcttgtgt 
SStcgct cagcatgcaa gtcgaqaaaa gaaagaccaa aacaatgaaa 
aaatcaaaaa gttgaaggac tttaacgtcg agatctctcg 
S^?SSag gttgcataca atacttttit ttcagacttt 
a££n£££ SSSaa tatcttattg ctgttatagt agttgagtga 
?g?S?aggg aatttctagt ccgttaatct tgtactcag? gtgtctact? 

TOAttMM TCAGTTTTTC AGTCTCTAAA ACACATT7AA ATAMMTTT 
cSScCCA- CTTTTGTTCC TCATCCTAGG CTTGGAGTCA ACACAACACA 
ACAACAATGA ATCTCCATTT TTCf GTTTCT TTACTTCTC? CTTTATCTCT 

ACAACAATCA Ai'^ CGGTGTTATT TCAGGTATCC ATCTCCAAAG 

TTTTCCTATG TATATGTATC TCTATGTTTA 
ATAAAGAAAA GTTAGTTTCT CTAGAATCTT 
TCAATTGGGA TTCGAGTAAT AAGCAAGGCG 
AACTTAGTTC CGGACTTGGC TAAAGCTGGA 
GTOACTCATG TTTGGTTGCC ACCATCATCT CACTCCGTTT CTCCTCAAGG 

SSSSgS agtgattgtg acctagtaat ccaatgaagj caaaa^cc 

»rrr»»riw> AOAG m CTAAA TTTTAATGAA AATAGTTCAG ACAAGTTAA, 

JSSaISSS tSStagtt caatccataa aatttgatgt agtagttaca 

GACCAACTTA TATO^AOi J. ^^^^ TTTTATCC CA GGTTATATGC 

GATGCTTCCA AGTTTGGAAA TCAGCAACAA 
TTTAACATGA CCACGGGATC AAATCGGTTG 
AGAACTGCTG ATAAC AAAG A TAGCAGGGGA 

esssse issess lilE 

rraqvprrvMAC TTTGAACCTG CACCTGATAT CGATCATCTT 
£££££ SgSSSgA GTTATCAGAC TOGATGAACT gC^AATC 

tgaaattgga tttgatggtt ggcgtttcga ttttgttagg ggatatgcac 
cttgcattac caaaatttat atgggaaaca cotccccgga ttttgctgtt 

r^TGAA^TGT GGAACTCTCT TGCTTATGGC CAGGACG3GA AACCGGAATA 

Sis esses ssees ssss? 

ESSE iE£5» ggg- gssSS 

AATATGTGGC CTTTCCCTTC AGACAAAGTT 
TCTTACTCAT CCAGGAATCC CATCCGTGGT 
TTCTACATAT CTCATIGTTT TCTATTTTAC 
CAGGGGATTT GAGAAACTCG GCCTGTGGGA 
TCGTAATCCA TAAACAAACA CTCAAACTCT 
CTCAACTCOT TTTTCACCGT GTTAATTGAA 
CACTTCAACT TACAAAATGA TCGJGTAGCA CC^CAAAAA ™X0TCTCK 
raATTAGCCA CGTGCGAGAT ACACGAAAAT uAGTTGGAiyi A ^^' 

rRAATAAAAC caagctgagg tgtctaaatg tgcacnctca aagtnggatg 
tt^St^ agctgSgcc gaggccatgt ttgantgtta tgcttatagg 

SIJSSS TTTGTTTCCG ATTAGCTGAG OANTTGATTA AATCCTNGTT 
SS?££ cStNCTTTG ATNGGGGCTN CNAGGATGGA 

ISagcac taamctctat taggaaaagg aataggattt GTGCANCAAG 



TCCTATGTTT GCCTCTTCGA 
AACCTTATTT' TTCTCTTAAC 
TGTAGTACI"? GCTCAAGTAT 
TGAATTCATT TGTTAGGGGT 
GATGGTACAA CTCTCTCATC 
GTTACTCATG TTTGGTTGCC 



in 

AAATGGAATT GCTTGAAGGC 
CAGGAAGGTT GTATGACTAG 
CTGAAAACTC ttattaaggc 
CTGATATAGT GATAAATCAT 
ATATACAGCA tctttgaagg 



TGGGATGATC GGTGTTTTGC 
ATGATACTGG ATCGACACAA 
ATGCAAGGAT ATGCATACAT 
AAAAAAAATA AATAAATTCT 
AAGAAATTTA TATTCTTTTC 
GTTTGCTCAC ATTGCCAGTC 
GAGTGTGCAC ATCTAGACAC 
CACTTCAACT TACAAAATGA 



XJQ406 

insertion 
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FIGURE 6 (ii) 



3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 



CWvTGTGCAA ATAATGGCTC CTGATTCTGA ATCTTTATAT ANCAATGGAT 
CA^SSaA TCATtScS GATTGCACCA AAACTTGATC TT-3GAAATCT 

SStcScct aatSSgagg tgccaacttc tggacaagac tatgctgtat 

GG6AGCAAAA GGCATAATCA TATTGTACCA CACTAAAAGG GACCATGGCC 
ACAATGGTTC TCATTAGTGT TAATGTTAI'A tgattgaaaa totaatttat 
££SSa tgaaggccaa aaattcaaga aattataaac aattcaatac 
£S£tca a^tcacaatt acattatgac ttctctattg caaactagtt 
tgStccaca ttattgtctc ctaaaatttt acaacatttc waagggaac 
tXawaott acagtgaaca tatgttcaaa ttacccttta tccccttaca 
Sgatttaa taaa^atttc ccctatccct ttggtagttg gttagagtta 
taagtaacgt agagattagt tataagagaa tttatgtatt attatccaga 
tctttagtta tatcgatttt agttatttat atgttgatta ?ttcaccttc 
Saatgcat ataaagatcg taaatgattg gattgatcga attcgaatga 
SiSSSSS gaactaatct tcaaatttaa tataaatttt ttttgtcaac 

SSSScC AAACGGCTCC AAAACAATAA ATAATTTACA TTTMKOT AG 
TATT-TATTT AAAATGGGAT NTTCCTCATC CCACTTGTAC CAGTTGAAAC 
CCTAATAATA AGCCAATCCA ACCGTCAAAA TTACAAATTT TQAAAATTGC 
GCTCC-JaCA GT7CTCCCCX ATTCAGATTT GATTCATTCT CTTCATTTT? 
SStocaS StScctct AAATCAACAA AATTCCCTTT GTTCAA&ISS Dem ATG 
— ^ rManrrrmk WtrmmBC I T T^MTTF CMCTfflgl 



- "Hi ^rilrA r ? *n™»™»ffi ^fflrft^flfflr M^MM* 



TjmnvinrA wtoctmm rwnppgre ^gMgg 

crAAAAcecc ^Twf ^ ^TWATGTTrr M<reAAAQrT 



^TTAAA r^ wrrwS re nyMWflflAA CA^PGrr.AAT TfTAAAJggfi 
wlngmSc rt rffiTTAAATC gggiflggsag 

i si ss ^ 

/cm Bry rTTvrrtAAA Arrgt ^TTf" aqprTrfflTflC TAAgATT^r W fi **™?C 

1™ S ttS^ r^;^ ^r™?: j*^™ 



si! gaassj s^r 



wip, , TOfl .AfM M7TAAGCTO wTfrffryy SSSS! 

Yd] £Z*<x rr* r^^AAA ^M^racc rrreaWr MC<MQW 

A Z\ SSSSSi j^ y^S^ y«ffrT»^ft MfflanrfflT TCflg^gc™ 

"on ™taS 7T y ^rm 1TAAOTOATT CTRGAATTCfl 

- g^ss ss S agaasgg: gsas 

cfpi .w^mtoa w»MflfWCT ^TfrTflrrTfl 

Bf ,«, ^cwctm wi<y^mac ^t^tt rK^nrr w^cg 



v^m^T cSV^a r^^^ isagg ggsga 



c-fc, a ^^^p^ rmtrrar,Mft TTPr.^Mg ^ ggg 



5501 fTA A^TAS ta^ n^"«^' gAgflTT^aTr CT1VGQ3GTC 

isssa ir^s s* t ^ 

. i mr rr^ ^ r ^rr rrg^g 



g < » HgSii^ f^V"^ '^^r .r^AAAC 

B r^rm^ fTfflCTT ffl T ^™TfT «rCTTOTO 



p^rfl^"^ ffrri"fl«a AArrrrrrn? attcacata? 

;™ ^gggs gsagr ffrfflmyy 



^F^y r rr^^ ^ 

C ff^i waayrwafafi WttAAffl yT ^ rA ' r ^ yrvrfl rflAfiA.AffrC g™ 1 **^ 

j amrr ^ fl r*rn>rrmr ggrAc rr ^ rrgTm;g[ ^ggrng 



59 o: 



acoca?™* TTAfiAfTT i'l T r « T*^Afl» ATTAirTGTCj rTCflfTACTAA 
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FIGURE 6 (iii) 

si or mr nt fflfl ft IM P™ 5 ™™ iraT rP""^ *rrrrrAr T rr CA*rer4£Cfl 
IS it aS j^ i mir i r TMTwmr rrecrraMa 
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